Picture for Yong Jae Lee

Yong Jae Lee

Latent Recurrent Transformer: Architecture Exploration, Training Strategies, and Scaling Behavior

Add code
May 26, 2026
Viaarxiv icon

Your Embedding Model is SMARTer Than You Think

Add code
May 24, 2026
Viaarxiv icon

From Plans to Pixels: Learning to Plan and Orchestrate for Open-Ended Image Editing

Add code
May 14, 2026
Viaarxiv icon

Exploration and Exploitation Errors Are Measurable for Language Model Agents

Add code
Apr 14, 2026
Viaarxiv icon

MuRF: Unlocking the Multi-Scale Potential of Vision Foundation Models

Add code
Mar 26, 2026
Viaarxiv icon

Unified Spatio-Temporal Token Scoring for Efficient Video VLMs

Add code
Mar 18, 2026
Viaarxiv icon

Spatially Grounded Long-Horizon Task Planning in the Wild

Add code
Mar 13, 2026
Viaarxiv icon

Reasoning-Augmented Representations for Multimodal Retrieval

Add code
Feb 06, 2026
Viaarxiv icon

Agentic Very Long Video Understanding

Add code
Jan 26, 2026
Viaarxiv icon

VideoWeave: A Data-Centric Approach for Efficient Video Understanding

Add code
Jan 09, 2026
Viaarxiv icon